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ABSTRACT 


Context. Determining distances to individual field stars is a necessary step towards mapping 
Galactic structure and determining spatial variations in the chemo-dynamical properties of stellar 
populations in the Milky Way. 

Aims. In order to provide stellar distance estimates for various spectroscopic surveys, we have 
developed a code that estimates distances to stars using measured spectroscopic and photometric 
quantities. We employ a Bayesian approach to build the probability distribution function over 
stellar evolutionary models given these data, delivering estimates of model parameters (including 
distances) for each star individually. Our method provides several alternative distance estimates 
for each star in the output, along with their associated uncertainties. This enables the use of our 
method even in the absence of some measurements. 
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Methods. The code was first tested on simulations, successfully recovering input distances to 
mock stars with <1% bias. We found the uncertainties scale with the uncertainties in the adopted 
spectro-photometric parameters. The method-intrinsic random distance uncertainties for typical 
spectroscopic survey measurements amount to around 10% for dwarf stars and 20% for giants, 
and are most sensitive to the quality of log g measurements. 

Results. The code was then validated by comparing our distance estimates to parallax measure¬ 
ments from the Hipparcos mission for nearby stars (< 300 pc), to asteroseismic distances of 
CoRoT red giant stars, and to known distances of well-studied open and globular clusters. The 
photometric data of these reference samples cover both optical and infrared wavelengths. The 
spectroscopic parameters are also based on spectra taken at various wavelengths, with varying 
spectral coverage and resolution: the Sloan Digital Sky Survey programs SEGUE and APOGEE, 
as well as various ESO instruments. 

Conclusions. The external comparisons confirm that our distances are subject to very small sys¬ 
tematic biases with respect to the fundamental Hipparcos scale (+0.4% for dwarfs, and -1-1.6% 
for giants). The typical random distance scatter is 18% for dwarfs, and 26% for giants. For the 
CoRoT-APOGEE sample, which spans Galactocentric distances of 4-14 kpc, the typical random 
distance scatter is =: 15%, both for the nearby and farther data. Our distances are systematically 
larger than the CoRoT ones by about +9%, which can mostly be attributed to the different choice 
of priors. The comparison to known distances of star clusters from SEGUE and APOGEE has 
led to significant systematic differences for many cluster stars, but with opposite signs, and with 
substantial scatter. Finally, we tested our distances against those previously determined for a 
high-quality sample of giant stars from the RAVE survey, again finding a small systematic trend 
of +5% and an rms scatter of 30%. Efforts are underway to provide our code to the community 
by running it on a public server. 

Key words. Stars: distances - fundamental parameters - statistics; Galaxy: stellar content 


1. Introduction 


A crucial step towards studying stellar populations and their variation across the Galaxy is to mea¬ 
sure reliable stellar distances. The parallax method is currently viable only for very local stars, 
although the recently launched Gaia satellite ( [Perryman et al.|[2001| l is expected to dramatically 
increase the number of parallax and proper motion measurements within a significant portion of 
the Milky Way. Precise distances may also be obtained from asteroseismology, but only for a frac¬ 
tion of stars subject to systematic variability studies ( |Miglio|2012[ Rodrigues et al.|2014j l. Indirect 
methods of distance determination, based on photometric and spectroscopic quantities and their 
relation to stellar absolute magnitudes, can be used for much more distant stars. Photometric dis¬ 
tances have been applied to multi-band optical and near infra-red data from different surveys to 
model the spatial distribution of Galactic stars and to study its substructures (e.g., jjuric et al.|2008| 
jCorrenti et al.|2010{ [Minniti et al.|20lT] l. Recent Sloan Digital Sky Survey III (SDSS-III, [Eisen-j 
[stein et JT]|201 l| l spectroscopic surveys, such as the Sloan Extension for Galactic Understanding 
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and Exploration (SEGUE, |Yanny et al.|2009[ ) and the Apache Point Observatory Galactic Evolution 
Experiment (APOGEE, [Allende Prieto et al.|200^ have produced many more observational con¬ 
straints, including stellar atmospheric parameters, kinematical and chemical data, that can be used 
to estimate reliable distances to tens of thousands of stars sampled from the SDSS and 2MASS 
( Skrutskie et al.|[2006| l photometric data. These surveys have already had a clear impact on our 
understanding of the Galaxy ( |Carollo et al.|2007 2010 Lee et al.|201 la[ Schlesinger et al.|2012 


|Carollo et al.|2012t|Cheng et al.|2012{|Bovy et al.|2012b|a[ [Anders et al.|2014[|Hayden et al.|2015| l. 

The recently completed RAdial Velocity Experiment (RAVE) survey ( [Steinmetz et al.p006) l has 
also provided atmospheric parameters, radial velocities and chemical abundances for six individual 
elements for more than 400 000 stars ( |Kordopatis et al.|2013| l. Ongoing spectroscopic surveys, such 
as the Gaia-ESO Survey (GES; [GilmorepOl^ , the Galactic Archaeology with HERMES survey 
(GALAH; Zucker et al. 2012| l, or the LAMOST Experiment for Galactic Understanding and Explo¬ 
ration (LEGUE; Deng et al.|2012[ ), are continuously increasing the number of stars with available 
spectroscopic information. In fact, these large spectro-photometric data sets will, when analysed 
together, allow a description of the structure and substructures of the Galaxy with unprecedented 
detail, excellent statistics, and full use of the 6D phase-space information. Phase-space reconstruc¬ 
tion of stellar distributions, coupled to spectroscopic abundances for large samples, will continue 
to provide challenging quantitative tests to models of Galaxy formation and evolution. 

In order to take full advantage of the large set of available spectroscopic and photometric pa¬ 
rameters from recent surveys, probabilistic inference has been used by several authors to infer 
ages, absolute magnitudes, extinction, distances, among other parameters (e.g., Pont & Eyer|2004[ 


|j0rgensen & Lindegren|[2005] |Bailer-Jones||201 1] |Serenelli et al.||2013[ [Schonrich & Bergemann| 

|2014[ ). In the context of stellar distances, a series of papers with increasing levels of sophistication 
has been presented by the RAVE collaboration ( |Breddels et al.|20T0{ [Zwitter et al.|2010{ |Bumett| 
|& Binney|2010| l. [Burnett & Binney| ( |2010| l make use of a comprehensive set of measured param¬ 
eters and their estimated uncertainties to infer, for each star, the probability distribution that a set 
of chosen stellar models generate the data. The method was further refined by [Burnett et al. (20111 


and Binney et al. (20141, and later used to study Galactic chemo-dynamics in, e.g. Boeche et al. 


( 2013a|b| l; Binney ( 2014| l or Kordopatis et al. ( 2015| l. 


In this paper, we follow a similar theoretical background as those authors, and implement a 
code that computes spectro-photometric distances with the goal of mapping large stellar samples 
in three dimensions or in phase-space. We are motivated by the analyses of SDSS-III SEGUE and 
APOGEE data being led by the Brazilian Participation Group (BPG) and the Leibniz-Institut fiir 
Astrophysik Potsdam (AIP), which are presented in Anders et al. ( 2014| l and Brauer (2015; PhD 
thesis, subm.). These papers use APOGEE giants and SEGUE G-dwarfs, respectively, to improve 
chemo-dynamical constraints to the Galactic components, especially the discs (see discussions in 
[Minchev et al.|[2014b| [Chiappini et al.|2015| l. Since SEGUE and APOGEE targets were selected 
based on different photometric data and have different spectral coverage and resolution, our basic 
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Table 1. Spectroscopic stellar surveys for which distances were computed using the method presented in this 
paper. For each case we provide basic information, such as duration, number of stars for different signal-to- 
noise ratios (SNR), spatial and spectral coverage, and spectral resolving power. 


Survey 

Year 

stars 

Spatial coverage 

Typical SNR 

d range 

R 

SEGUE (DR9 G dwarfs) 

2004-2008 

== 120,000 (35,000) 

1,300 deg^ 

25 (35) 

0.38 - 0.92/jm 

2,000 

APOGEE (DR 10 HQ giants) 

2011-2014 

100,000 (22,000) 

^ 2,800 deg2 

100 (120) 

1.5 - 1.7yum 

22,500 

RAVE (DR4 HQ giants) 

2006-2013 

== 500,000 (9,000) 

^ 20,000 deg^ 

30 (70) 

0.84 - 0.88/jm 

7,500 


challenge is to ensure that accurate distances are computed for datasets of vastly different prove¬ 
nance, in a homogeneous way. Our emphasis is therefore on confronting our distance estimates 
with as many reference samples as possible. A direct comparison with the RAVE distances ob¬ 
tained by Binney et al.| ( |2014) for the high-quality giant sample studied in Boeche et al. ( 2013a| l is 
also provided. In Table[T]we list the main characteristic of the surveys we have applied our method 
to. 


The outline of the paper is as follows. In Sec. we review the method, introduce our notation, 
and show the results of initial validation tests. An analysis of the performance of our code in terms 
of internal accuracy and precision is provided. We further discuss how biased stellar parameters and 
prior assumptions influence the estimated distances. In Sec.we compare our distances to several 
previous distance determinations that can be used as a reference, given their higher precision and 
more controlled systematics. We also compare our results for a high-quality sample of RAVE giant 
stars to those obtained by the RAVE collaboration. Our summary, conclusions, and future plans are 
provided in Sec.|^ 


2. The Method 


The general method adopted for this study uses a set of measured photometric and spectroscopic 
parameters, such as metallicity, [Ee/H], alpha element enhancement, [a/Ee], effective temperature, 
Teff, surface gravity, logg, intrinsic apparent magnitude, m, and colors, to estimate the distance to 
individual stars. These quantities are compared to predictions from stellar evolutionary models. The 
comparison between model and measured parameters follows a statistical approach that is similar 
to previous works ( Burnett & Binney|2010 Burnett et al.|2011[ Binney et al.|2014| l. 

In brief, assuming that the errors in the measured parameters follow a normal distribution, the 
probability that a measured value of some quantity, x + cr^., is consistent with some theoretical 
value, xo, is given by 


P(x, cr^\xo) = Afx„,< 7 -W = 


1 


( 


exp 


(x - Xq)^ 
2crl 


( 1 ) 


We can easily extend this reasoning to a set of (independent) measured parameters, x = {xi,..., x„), 
with associated Gaussian uncertainties, <Tx, whose theoretical values according to a given model 
are xq, by writing: 

P(x, CTxlxo) = Y\ ^xoj,o-i(xd (2) 
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where the product is taken over all the measured parameters of a single star being confronted to 
the model values. The expression above gives the likelihood of measuring the set {x, cr^) given a 
model xq. According to Bayes’s theorem, we may compute the posterior probability distribution 
(the probability of the model, given the data) as: 


P(Xo|x, crj 


F(x, o-x|xo) P(xo) 
F(X, o-x) 


( 3 ) 


where the numerator contains the likelihood and the model prior probability, and the denominator 
depends only on the measured parameters and their uncertanties[^ 

In order to evaluate the probability of some specific model quantity, 0 xq /, we consider the 
marginal posterior probability distribution for this quantity, which is obtained by integrating over 
all variables of Equation (3), except 0: 


p(0) := F(0lx, o-x) ^ J dxo,o-dxoj-idoj+i...dx„F(xolx,o-x) (4) 

As mentioned earlier, atypical set of measured parameters includes x = {[M/H], Teff, log g, colors, mj. 
As for the stellar models, besides the theoretical values of the same parameters, they also involve 
other quantities such as mass m*, age t, and absolute magnitude Mabs- And we take 0 - d, where d 
is the star’s model distance computed as d [pc] = _ 


2.1. Distance Uncertainties 

Our code can deliver various statistics for the desirable quantity. In this work, and following |Ro-| 
[drigues et al.| ( |2014| l, we compute a star’s distance d as the median of the marginalised posterior 
probability distribution, p(d) (Eq. 4). To estimate the uncertainties, we report the 68% and 95% 
upper and lower credible intervals of the median, t/esL, d^su, dgsi, and dgsu. When quoting a single 
uncertainty value, we use the definition 


cr(d) 0.5 ■ (d^su - d^gi). 

Whenever there is some ambiguity, we use the subscript “BPG” to denote our Bayesian distance 
estimate^ 


2.2. A test suite of simuiated stars 

In order to verify the method and our implementation, we use a set of simulated stars drawn from a 
grid of PARSEC 1.2S stellar models ( [Bressan et al.|201^|Tang et al.|2014 Chen et al.|2015| l with 
a narrow metallicity step of A[M/H] =0.1 and spanning the range -2.2 < [M/H] < 0.6, and age 
steps of logr[yr] = 0.05 in the range 7.5 < logT[yr] > 10.15. Specifically, we randomly select 


' Because we are only interested in inferring the best model parameter (in our case the distance) for a specific 
set of models, this term is dropped in further computations, as it merely represents a constant which can be 
normalised out (e.g. jlvezic et al.|2013| Chap. 5). 

^ BPG is short for the SDSS-lil Brazilian Participation Group 
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Simulated Stars: Stellar parameter distribution 
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Fig. 1. Stellar parameter distributions of the simulated PARSEC sample in the Tef! -log g diagram (without 
observational errors added). Upper left: solar-scaled metallicity, upper right: mass (in Mq), lower left: lumi¬ 
nosity (in Lg), lower right: age (in years). 


5000 models which fall into the parameter regime targeted by large-scale spectroscopic surveys, by 
restricting model space to 3000 < < 7000 K and 1 < logg < 5 (see Fig.[T]l. These simulated 

stars are assigned random distances between 0.1 and 6 kpc, drawn from a uniform distribution. 
The simulated photometry is in the 2MASS JHKs system ( |Cutri et ar]|2003| l, with no extinction 
applied. 

By adding typical (conservative) Gaussian random errors to the model observables, we then 
create two sets of spectrophotometric “observations” from this file: a “high-resolution-like” set of 
spectroscopically measured quantities - 100 K, etogg = 0.1 dex, e[z/H] =0.1 dexQ and 
a “low-resolution-like” set - 200 K, eiogg - 0.2 dex, e[z/H] = 0.15 dex). We assume that 
the errors are uncorrelated and the observational uncertainties are correctly estimated. Photometric 
errors are also assumed to be Gaussian and of the order of 0.01 mag. 

Some of the effects of possible deviations from these assumptions are discussed in the next 
Sections (stellar parameter biases, incomplete parameter sets, extinction. Galactic density priors). 


^ These values can be considered “typical” for stellar parameters obtained from high signal-to-noise-ratio 
(SNR) spectra from a multi-object high-resolution spectrograph like FLAMES/GIRAFFE or APOGEE. For 
very-high-resolution spectra from, e.g., HARPS or UVES, these values may even be lower by about a factor 
of 2. 
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Simulated Stars: Results 



j 4 a u / o o.a 'j a.a luu i z o 4 a u (uuu uuuu auuL 
m, U/.] logr [kpc] Tiff [K] 


0.50 


0.25 


0.00 







c 

“C 


I 


- 0.25 


a 


- 0.50 

0.5 

0.4 


o 



Fig. 2. Results from the test with the simulated PARSEC sample with “high-res.-like” observational errors 
(sr^g = lOOK, e[z/H] = 0.1 dex, eiogg - 0.1 dex, and = 0.01 mag.), using the full input parameter set 
(Set 1: X = j[M/H], Teff. logg. wt, colors)). Left six panels: Relative distance errors (observed - true) of the 
simulated stars shown as a function of the (true) main stellar model parameters T^s, [M/H], logg, mass m*, 
age T, and distance The solid line marks the identity line, the dashed lines indicate distance deviations 
of 20%. Upper right panel: Relative distance error distribution in the T^tf- log g diagram. Lower right panel: 
Relative distance uncertainty distribution in the 7'eff- log g diagram. Clearly, distances to giant stars are much 
more prone to systematics and have larger uncertianties. 


Other effects (e.g., [a/Fe]-enhanced isochrones, correlated and non-Gaussian stellar parameter er¬ 
rors) are beyond the scope of this paper, and deferred to a future investigation. 


2.3. Internal accuracy and precision 


Fig. 1^ summarises our results from the test with the simulated PARSEC sample with “high-res.- 
like” observational errors, when using the following set of measured quantities as an input; x = 
{[M/FI], 7’eff,logg,m,colors). In each panel except for the last, we show the relative deviations 
of our estimated values from the true distances, (c/bpg - '^true)/'^true. as a function of the model 
parameters. 

The first row of this Figure demonstrates that our distance errors are typically below 20% and 
do not show any strong systematics with the measured quantities Teff, [Z/H] and logg themselves. 
However, especially the upper right panel (distribution of distance errors in the Teff- log g diagram) 
reveals that our distances for main-sequence and subgiant stars are much better recovered than for 
giants (mean deviation -0.5%, root-mean-square scatter 10.6% vs. 1.8% and 23.3%, respectively). 
This is a known problem, because of the larger overlap of models with very different luminosities 
in the giant region of the Hertzsprung-Russell diagram (see Fig.[^. 

The second row of Fig. [^examines the dependency of our distance errors with the model param¬ 
eters which are not directly measured through spectroscopic observations: mass, age and distance. 
While there is no correlation between the distance errors and the true distances themselves, we do 
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Fig. 3. Distance accuracy and precision for the four input observable sets introduced in Sec. |2.4| Upper 


panels: Distribution of relative distance errors, (£/BPG-^^trije)/rftnje, for the sample of simulated stars drawn from 
PARSEC models (see Fig. [TJ with “high-resolution-like” observational errors {left, see Sec. \2.2\ and “low- 
resolution-like” errors (right). In each panel, we are using the distances based on the four sets of measured 
parameters listed in Sec. 2.4[ The results for dwarfs are shown in black, the results for the giants in grey. Solid 
lines’. Set 1 (x = {[M/H], reft, log g,m, colors)); dashed lines’. Set 2 (x = {[M/H], T^s, logg, m)); dash-dotted 
lines: Set 3 (x = {[M/H], logg, m, colors)); dotted lines: Set 4 (x = {[M/H], m, colors), with a cut at |A logg| < 
0.5 dex). Bottom panels: Distributions of our relative (internal) distance uncertainties, cr(dBPG)/dBPG. 


see some systematic trends with mass and age. In particular, our code tends to slightly overpre¬ 
dict the distances to very old, very-low-mass giant stars, while it underpredicts the distances for 
very young (r < 100 Myr) giants with super-solar masses. However, when taking into account the 
Galactic star-formation history and initial mass function (IMF), these groups typically represent 
a negligible minority of the stars targeted by large spectroscopic stellar surveys; their fraction is 
clearly overrepresented in our test sample. We can therefore be confident that our code delivers 
unbiased distance estimates for the vast majority of stars. 

Finally, in the lower right panel of Fig. we show the distance uncertainties for the simulated 
stars, again as a function of Teff and logg. Evidently, the internal precision of our method is much 
better for main-sequence stars than for giants (excluding the red-clump/horizontal-branch stars 
at logg ~ 2.5, for which we also find accurate and precise distances). A comparison between 
the upper and lower right panels of this Figure also shows that our code determines reliable and 
unbiased statistical uncertainties. 


2.4. Sensitivity to input observabies 


Initial tests showed that, among the input measured quantities, logg is most critical to estimate 
unbiased distances, since it is the quantity that best discriminates between low-luminosity dwarfs 
and more luminous giants, which otherwise share similar values of temperature, metallicities and 
colors. Dropping logg from the fitting procedure leads to overestimated distances for dwarf stars, 
and underestimated distances to giants, respectively. Similar tests also showed that cutting models 
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Simulated Stars: Effect of stellar parameter shifts 



Fig. 4. Effects of systematic shifts in the spectroscopically observed stellar parameters Teff (left), [Z/H] 
(middle) and logg (right panels) on our determined distances. For this exercise, we again use the simu¬ 
lated PARSEC sample with “high-res.-like” observational errors and the full input parameter set (Set 1: 
X = {[M/H],logg,m,colors)). In each panel, the errorbars show the mean and standard deviation of 
the relative distance errors, as a function of the value of a fixed shift in the particular stellar parameter. Dwarfs 
(log g > 4) and giants are displayed separately in the top and bottom rows, respectively. 


with |xo - x| > 3crx decreases the processing time without appreciably changing the final distance 
estimates. 

Based on these initial assessments, we implemented a code that estimates distances using the 
method described above and based on four distinct sets x: 

1. X = {[M/H], Teff, logg, m, colors) 

2. x = {[M/H],7’eff,logg,m) 

3. X = {[M/H],logg,m,colors) 

4. X = {[M/H), m, colors), but with a cut in models whose logg differ by more than 0.5 dex from 
the measured value. 


The code can accommodate other sets if so desired. The advantage of implementing different 
sets of parameters for distance determination is to provide flexibility: distances can be estimated 
even in the absence of one or two parameters from the set with the largest number of parameters 
(Set 1). Furthermore, the alternative distance estimates may be intercompared to provide a means 
to evaluate the sensitivity of the method to the particular combination of spectroscopic and photo¬ 
metric parameters adopted. 

Often, the data set in a given sample includes only [Fe/H) values, which are used in place of 
[M/H). When [a/Fe) values are available, we adopt the empirical relation [M/H) = [Fe/H)H-[a/Fe) 
for the comparison to PARSEC isochrones (e.g., [Anders et al.||2014| l. Another important issue is 
extinction: the photometric data, m and colors, must be corrected for extinction and redenning 
before comparing with the models. The exact procedure depends on the bandpasses used, on the 
available extinction estimates and on the assumed extinction coefficients. 

The top panels of Fig.j^show the results of applying the method to our simulated observations 
(see Sec. |2.2[ ). The distribution of the relative distance errors, ((iuPG - dxrne)!dxnie, are displayed for 
the four parameter sets listed above, for both the “high-res.-like” (left panel) and the “low-res.-like” 
simulations, and for giants and dwarfs separately. 

The four distance estimates proposed lead to reasonable results as attested by these error distri¬ 
butions. Surprisingly, there is only a very small systematic degradation in the precision of our dis- 


Article number, page 10 of 


30 




























Santiago, Brauer, Anders, Chiappini, Queiroz et al.: Spectro-photometric distances to stars 


tances as we move down from set 1 to 4. As noted above, set 1 in the “high-resolution” case yields 
mean distance errors of -0.5% with an rms scatter of 10.6% for main-sequence stars (logg > 4.0), 
while for giants the numbers are expectedly a bit worse (-1-1.8% bias and 23.3% precision). 

For the other three sets, these numbers are -0.2% + 11.3%, -0.7% + 13.1%, -2.9% + 21.5% 
for dwarfs, and h-1.7% + 24.3%, -1-1.8 + 23.4%, h- 2.8% + 25.1% for giants, respectively. This trend 
is expected, at least in the context of simulations, since usage of more measured parameters will 
tend to better constrain the stellar models that best describe each star. 

As can be seen in Fig. (top right panel), these numbers are slightly worse for the “low- 
resolution” case. The relevant numbers for this and all other comparison samples are listed in Table 
In the following, we will use only the full input parameter set (Set 1; x = {[M/H], Tetr, logg, m, colors)). 

The bottom panels of Fig.j^show the distributions of our distance uncertainties, cr{dBPG)/dBPG, 
for each of the four input sets, and for dwarfs and giants separately. Again, we also show the 
numbers for the “low-resolution” case. As expected, the uncertainty distributions for dwarfs peak 
at < 10%, while the distributions for giants are broader and peak at larger values. 


2.5. Effect of systematic stellar parameter errors 


In order to gain some insight into the robustness of our method to systematics in the stellar parame¬ 
ters used, we have studied the effect of constant shifts in each of the spectroscopic input parameters 
on our distance estimates. Fig. |^shows the results of this experiment; in each panel, all except for 
one input parameter are kept constant for the whole simulated “high-res.” sample, while the remain¬ 
ing parameter is shifted by a certain amount, and the effect on the relative distance error (observed 
- true) as a function of this shift. As before, we treat dwarfs and giants separately, because these 
two groups display distinct systematic trends. 


The most striking observation from Fig. is that even a substantial systematic shift in the 
temperature (lATefrl < 200 K) or metallicity scale (|A[Z/H]| < 0.2 dex) does not terribly affect 
neither accuracy nor precision of our distances. This is in stark contrast to systematic shifts in 
the surface gravity parameter (which are in fact quite commonly found even in high-resolution 
spectroscopic surveys - e.g., Holtzman et al.|20T5]l. 


2.6. The model priors 


In order to determine distances to stars in the Milky Way, we use our current knowledge about the 
Galaxy and its components to build the model prior probability distribution, P(xo). This includes 
basic information concerning the distributions of stars as a function of position, mass, age, and 
metallicity. 

In particular, we assume that all stars follow a Chabrier initial mass function ( |Chabrier|2003[ ) 
and incorporate the probability that a randomly selected star falls within a given mass range, p(m*). 
This involves an integral over the mass function which is provided by the model isochrones. 
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Reference Sample Comparisons 




Fig. 5. Relative distance residuals (estimated - reference) as a function of the main spectroscopic parameters 
Teff, [Z/H] and logg as well as the reference distances Top rows: Comparison to the astrometric distance 
scale provided hy the Hipparcos satellite, using the |Bensby et al.|f2014^ FGK dwarf sample (1st row) and the 
APOGEE-tfipparcoi giant sample (2nd row). 3rd row. APOGEE clusters. 4th row: CoRoT-APOGEE sample. 
5th row: SEGUE clusters. The central dashed line is the identity line, whereas the upper and lower ones 
indicate deviations at the 20% level. 


We take into account the effect of observing a given star in a small but bnite solid angle, by 
applying a geometric prior oc s^ds called the volume element, where s is the model distance. We 
further assume different age and spatial distributions for the basic Galactic components, namely the 
thin disc, the thick disc, and the spheroid. We here adopt the same Galactic structural parameters 
as Binney et al. ( 2014|l, and refer to this work for a detailed justihcation. 


The age distributions, p(t), are taken to be uniform with an upper limit of 10 Gyrs for the thin 
disc, and lower limits of 8 and 10 Gyrs for the thick disc and halo, respectively. 


The spatial priors for the thin and thick discs are given, respectively, by 


Ahin(^,Z) oc . exp(-/(//!«,thin) ■ exp(-|Z|/%thin), 
Pthick(f^,Z) oc . exp(-f;//!R,thick) ■ exp(-|Z|//!z,thick), 
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Table 2. Summary of the reference data: parameter ranges, uncertainties, and provenance, log g values in the 
CoRoT-APOGEE sample are based on asteroseismic measurements. All other non-photometric parameters 
were derived spectroscopically; the relevant references are given in the footnotes indicated in the third column. 


Sample 

dist. range [kpc] cr (7’ eff ) [K] range [K] 

o-logg log grange 

cr [Fe/H] 

[Fe/H] range photom. 

magnitude range 

Simulated sample 

0.1-6 

100 

3000 - 7000 

0.1 

1.0-5.0 

0.1 

-2.2-0.5 

JHK, 

distance-limited 

Hipparcos-Bensby 

<0.3* 

30- 100 2 

4800 - 7000 

0.1 

3.0-4.8 

0.1 

-2.5 - 0.5 

BjVt^ 

V <92 

Hipparcos-APOGEE 

<0.3* 

90 

3800 - 5500 

0.2 

1.0-4.0 

0.1 

~ -1.0-0.5 

JHK,^ 

{J-Kh >0.5;//<7 

CoRoT-APOGEE 

1-6^ 

100“* 

4000 - 5300 

0.05 

1.6-3.0 

0.1 

1 

p 

bo 

1 

p 

Ui 

JHK,^ 

(J-K)a >0.5;//< 13.8“* 

APOGEE Clusters 

1-6S-’ 

100** 

3700 - 5000 

0.2 

1.0-3.4 

0.1 

-1.0-0.5 

JHK, 

(J-K)o >0.5;//< 13.8'* 

SEGUE Clusters 

1 - 12'*-'* 

130 *° 

4700 - 7000 

0.21 

1.6-5.0 

0.11 

-3.0-0.0 

ugriz ** 

0.48 <(g- r)o < 0.55; ro < 20.2 *** 


Referenc es. (l)|van Leeuwen||2007} ; (2) |Bensby et al.H2014); (3)|H0g et al. (2000); (4) Anders et al. (2015) and references 


ther ein; (5)|Cutri et al.|(2003T; (6) WEBDA Database; (THHairis 1996); (8) Ahn et al.1(2014) ; (9) |Zasowsld et al.|(2013) ; 


(10) |Ahn et al. 1(2012) ; (1 l) |Fukugita et al.|(1996) ; (12) |Lee et al. 


(2008a) 


where {R, Z) are cylindrical coordinates with origin at the Galactic center, whose values are com¬ 
puted given the model distance s and a direction (I, b), and /ir and hz are the respective scale lenghts 
and heights. In computing the cylindrical coordinates, we use Rq = 8.33 kpc as the distance from 
the Sun to the Galactic Centre. 

For the halo, we assume a power-law density profile with spherical symmetry; 


Phaio(^ I, b) OC i^di ■ r 


where r{s,l,b) is a radial spherical coordinate. Again, as in |Binney et al.| ( [2014[ ), we adopt j - 
3.39. 


The three density profiles are normalised at the solar location, which, for simplicity, we take to 
be {R,z) - (Rq,Q) and r - Rq, respectively, in cylindrical and spherical coordinates. The normal¬ 
ization values are again from Binney et al. ( 2014| l. 

We assume that the three basic Galactic components follow Gaussian metallicity distribution 
functions (MDFs), /7,([Z/FI]), exactly as described by |Binne^ ( |2014| l. Our final model prior is then 
given as 


P(xo) = p(m*) ^ p,(t) pi{r) pi{[ZIH]). 


3. External Validation 


In this section, we present results of the application of our method described in the previous section 
to a number of reference samples for distance determination. These include distances computed 
from astrometric parallaxes obtained by ESA’s Hipparcos mission ( |Perryman|1989[|Perryman et al.| 
I Leeuwen|2007|l, precise asteroseismic distances produced by the CoRoT-APOGEE col- 


1997 


laboration for a set of solar-like oscillating red giant stars (Anders et al. 2015, subm.), and distances 
to well-studied open and globular clusters. As in the previous sections, we use dspc to refer to our 
distances in all figures. Those taken as reference are referred to as drei- In Table we provide 
information about the sources, derivation methods and precision of the parameters in the reference 
samples. The results of the comparison with the reference samples are summarised in Eig. |^and 
Table We also compare our distance estimates with those obtained by [Binney et al.| ( |2014| ) for 
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a sample of giant stars from the RAVE survey. This last comparison represents a cross-check of 
the implementation of the algorithm rather than a benchmark test, as the authors follow a similar 
theoretical background. 

Fig. [^presents the comparison of our spectro-photometric distances (BPG) to those from the 
reference samples. The first three panels of each row show our relative distance residuals with 
respect to the reference set, as a function of the main spectroscopic parameters [Z/H] and 
log g. The last panel of each shows the residuals as a function of the reference distance. Each row 
corresponds to a particular reference sample discussed in this Section. 


3.1. Comparison with the Hipparcos scaie - The FGK star sampie of \Bensby et a/.| (20141 


As a first validation test of our spectro-photometric distance algorithm, we use the high-resolution 
FGK dwarf sample of 714 solar-neighborhood stars with well-determined atmospheric parameters 
by |Bensby et al.| ( f2014) l. Hereafter, we will refer to this sample as the Hipparcos-Benshy sample, 
because for these stars precision parallaxes from Hipparcos ( |van Leeuwen||2007] l are available. 
We use the optical photometry from the Tycho-2 catalogue ( |H0g et al.|2000| l, and set mag = Vj, 
Mibs - Mvt colors = {Bj - Vt} in the set of parameters to be compared to the models. As 
throughout this paper, the models used for this comparison are from the PAdova TRieste Stellar 
Evolution Code (PARSEC 1.2S, Bressan et al.||2012{ Tang et aL]|2014 Chen et aL]|2015 1, with a 
narrow metallicity step of [M/H] = 0.1 and age steps of logT(yrs) = 0.05. 

The top row of Fig.j^shows the results of our distance code for the Hipparcos-Benshy sample. 
There is little or no systematics in the residuals with the parameters themselves. The mean and rms 
residuals over all stars are 0.4% and 18.2%. In fact, the observed rms residual is comparable to the 
expected errors in the parallaxes and in our distances based on the “high-res.” simulation (presented 
in Sec. |2.2| i combined together. Because the Hipparcos-Benshy sample combines high-resolution 
spectroscopic data for nearby dwarf stars with accurate parallaxes, it is arguably the best reference 
sample to test our code against. 

We have validated that, as in our simulation tests described in Sec. 2, the four sets of parameters 
yield acceptable distances. In the subsequent validation analyses in this paper, we will concentrate 
on the full parameter set { T^g, [Z/H], log g, colors), since it makes full use of the set of spectroscopic 
and photometric parameters which is common to the surveys to which we are applying our method. 


3.2. APOGEE 


The SDSS-III project APOGEE has acquired high-resolution near infra-red (NIR) spectra (R ^ 
22,500) of > 100,000 stars selected from the 2MASS Point Source Catalogue ( |Cutri et al.|2003| l, 
most of which are red giant stars located at low Galactic latitudes. APOGEE is providing a unique 
spectroscopic sample of disc-dominated stars with unprecedented volume coverage, for which pre¬ 
cision kinematical and abundance measurements are available. For more details about APOGEE 
data, we refer to |Ahn et al.| ( |2014] l; |Alam et al.|P015] l, and |Holtzman et ar]p015] l. Validation of 
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Table 3. Summary of the results for the reference distance samples used in this work. 


Sample 


Dist. Range [kpc] 

Mean rel. residuals [%] 

RMS rel. residuals [%] 

Simulated “high-res.” stars 

5000 

0.1-6 

-t-1.2 

20.9 

Dwarfs (log g > 4) 

1248 

0.1-6 

-0.5 

10.6 

Giants (log g < 4) 

3752 

0.1-6 

+ 1.8 

23.3 

Simulated “low-res.” stars 

5000 

0.1-6 

+3.3 

33.4 

Dwarfs (log g > 4) 

1248 

0.1-6 

-0.6 

15.5 

Giants (log g < 4) 

3752 

0.1-6 

+4.6 

37.4 

Hipparcos-Bensby dwarfs 

714 

< 0.2 

+0.4 

18.2 

Hipparcos-AFOGEE giants 

712 

< 0.3 

+ 1.6 

26.4 

CoRoT-APOGEE giants 

644 

1 - 12 

+8.7 

14.9 

APOGEE clusters - cluster age prior 

392 

1 - 15 

-16.5 

29.9 

SEGUE clusters - field priors 

419 

1 - 12 

+ 14 

43 

A log g =-1-0.25 dex; field priors 

425 

1 - 12 

-2 

35 

A log g =-1-0.25 dex; clus. age priors 

385 

1 - 12 

+ 13 

39 


APOGEE stellar parameters partly relies on calibrating star clusters with many probable member 
stars with well-determined parameters ( [Meszaros et al.|[2013[ |Holtzman et al.|2015| l. In this sec¬ 
tion we use two APOGEE subsamples to test our distances. In Sec. |3.2.2| we use a set of 678 
CoRoT stars in the exoplanet fields LRaOl and LRcOl (e.g., |MigIio et al.|2013| l which have been 
co-observed within an APOGEE ancillary campaign, and whose asteroseismic analysis has pro¬ 


vided high-precision surface gravities (Anders et al. 2015, subm.). In Sec. 3.2.3 we make use of 
the APOGEE calibration cluster dataset. 

In the case of APOGEE, we set m - K^, M^bs = and colors = {(7 - H), (H - /fj)), all of 
which are corrected for extinction before computing the distances. Eor the cluster stars, extinction 
values are based on the Rayleigh-Jeans Color Excess method (RJCE, [Majewski et al.|201 l||Nidever| 
et al.|2012 Zasowski et al.|20T3]l. This method is based on stellar colour excesses measured using 


fluxes in near and mid infra-red bands, where almost all stellar spectra are in the Rayleigh-Jeans 
regime and therefore have very similar intrinsic colours. Eor the stars in common with CoRoT, 
we use the precise extinctions calculated by Anders et al. (2015), using the method described in 
[Rodrigues et al.|p014[ ). 

We use the calibrated stellar parameters determined by the APOGEE Stellar Parameters and 
Chemical Abundances Pipeline (ASPCAP; Holtzman et al.|20T5 Garcia Perez et al. 2015, subm.). 
Some minor improvements on the DR12 calibrations of metallicity and surface gravity have been 
applied; they are described in AppendixjA] Since [cr/Pe] is again available, we assume that [M/H] = 
[Ee/H] -H [a/Ee] for APOGEE star^ 


Eor the comparison with the CoRoT sample (Sec. [3.2.^ , APOGEE surface gravities were re¬ 
placed by those from asteroseismic scaling relations (e.g., Miglio et al.|2013| l. Table [^summarises 
the data provenance. 


We here refer to ASPCAP’s cluster-calibrated [M/H] values ( jAlam et al.|2015|[Holtzman et al.|20i5t as 
[Fe/H], as ASPCAP metallicities were calibrated on literature [Fe/H]values. 
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3.2.1. The Hipparcos-APOGEE sample 

Through a fiber feed from the New Mexico State University Im telescope at Apache Point Observa¬ 
tory to the APOGEE spectrograph, it is also possible to use APOGEE to observe smaller numbers 
of bright stars, one at a time ( |Holtzman et al.|[2015| Sec. 2.3). Apart from calibration, this setup 
is also being used within an ancillary campaign to observe several hundred red giant stars with 
measured Hipparcos parallaxes (Eeuillet et al, in prep.). The data processing and stellar parameter 
determination works in exactly the same way as for the data taken with the SDSS 2.5m telescope. 

We have used the DR12 Hipparcos-AY’OGEE sample of 712 stars with precise parallaxes 
{(j{n)ln < 10%) to fundamentally validate our stellar distance estimates also for giant stars. The 
second row of Eig. [^shows our results. The mean and rms residuals amount to h- 1.6% and 26.4%, 
respectively. 

Recalling the tests performed in Sec.|^ we find that these numbers are very much comparable 
with the trends found for the giant sample of simulated “high-resolution” stars (see Table [^. Tak¬ 
ing further into account that the Hipparcos parallaxes for these relatively distant objects are also 
affected by uncertainties of order 10%, the results indeed supersede expectations. Furthermore, we 
find no systematic trends with either of the stellar parameters. 


3.2.2. APOGEE - comparison with asteroseismology: the CoRoT-APOGEE sample 


Another important test for our distance method is the comparison with asteroseismically derived 
distances. It has been shown by recent studies (e.g., |Miglio|2012t|Silva Aguirre et al.|2012||2013[ ) 
that stellar distances (or equivalently, radii) determined from CoRoT and Kepler lightcurves, either 
via asteroseismic scaling relations or by comparing asteroseismic parameters to predicted values 
from a grid of models, agree within a few percent with Hipparcos parallaxes and eclipsing bi¬ 
nary data. For example, Silva Aguirre et al. ( 2012| l show that distances can be derived with 5% 
precision for solar-like stars by coupling the infra-red flux method to determine Tetj and bolomet- 
ric fluxes to the grid-modelling of the measured values of the frequency of maximum oscillation 
power, Vmax, and of the frequency difference between dominant oscillation modes, Av. We there¬ 
fore compared our spectro-photometric distances based on APOGEE stellar parameters with the 
distances obtained from CoRoT data for 678 stars in the fields LRaOl and LRcOl, which have been 
successfully observed by both instruments. The reference distance scale is that of Anders et al. 
(2015) who used the well-tested stellar parameter estimation code PARAM ( |da Silva et al.|2006) 
Rodrigues et al.|2014|l, which is based on a similar theoretical background as this paper, but differs 


in a number of details. PARAM utilises the parameter set {Teff, [Z/H], Av, Vmax) to first estimate the 
posterior probability over the stellar models (i.e., it delivers mass, radius, age and absolute mag¬ 
nitude pdfs). In a second separate step, it compares the derived absolute magnitude pdfs with a 
number of broad-band photometric measurements to derive individual stellar distances and extinc¬ 
tions. The code has been extensively tested in [Rodrigues et al.1 ( |20l4| l and Anders et al. (2015); it 
delivers very precise distances for asteroseismic targets icr(d) < 2%). Because we use the same 
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Table 4. Summary of the properties of star clusters used as reference for APOGEE and SEGUE distance 
estimates: dereddened cluster distance moduli, ages, metallicities, reddenings, and numbers of stars observed 
by APOGEE and SEGUE. Almost all of the listed clusters were used for APOGEE calibration, while SEGUE 
targeted a larger number of cluster members in some selected calibration clusters. 


Cluster 

(m - M)o 

log(T) 

[Fe/H] 

E(B-V) 

^APO 

N 

SEGUE 

NGC 188 

11.55 

9.63 

-0.02 

0.082 

5 


NGC 2158 

13.52 

9.02 

-0.23 

0.36 

15 

10 

NGC 2420 

12.45 

9.05 

-0.26 

0.029 

16 

125 

NGC 5466 

16.02 

10.13 

-1.98 

0.0 

5 


NGC 6791 

13.06 

9.64 

-rO.15 

0.117 

22 

37 

NGC 6819 

11.86 

9.17 

+0.07 

0.238 

23 


NGC 7789 

11.84 

9.23 

-0.08 

0.217 

4 


M2 

15.50 

10.11 

-1.65 

0.06 

3 

64 

M3 

15.07 

9.69 

-1.5 

0.12 

18 

35 

M5 

14.46 

10.02 

-1.29 

0.03 

84 


M 13 

14.33 

10.06 

-1.53 

0.02 

46 

149 

M 15 

15.39 

10.07 

-2.37 

0.10 

31 

78 

M35 

10.37 

7.97 

-0.16 

0.262 


6 

M53 

16.32 

10.10 

-2.1 

0.02 

13 


M67 

9.79 

9.41 

0.00 

0.059 

48 

50 

M71 

13.01 

10.00 

-0.78 

0.25 

2 


M92 

14.65 

10.14 

-2.31 

0.02 

34 

39 

M 107 

15.05 

10.14 

-1.02 

0.33 

39 


Be 29 

15.86 

9.025 

-0.44 

0.157 

9 



data and similar methods to derive stellar distances, the distance scale of Anders et al. (2015) does 
not provide a completely independent benchmark. However, it serves as an important cross-check 
for the implementation of our code. 

The comparison is shown in the 4th row of Fig.We again show the relative distances residuals 
between our distances (BPG) and those from CoRoT-APOGEE. The general picture is reassuring, 
despite the limited number statistics. Our code converged for 644 stars, the mean and rms relative 
distance residuals are -1-8.7% and 14.9%, respectively. No strong trends are seen with metallicity, 
effective temperature or surface gravity. 

The systematic differences between the two methods may arise from the different handling of 
priors, while the rms scatter is more likely to be attributed to our use of only three photometric pass- 
bands instead of the multi-wavelength photometry used by PARAM, and to minor differences in 
building the distance pdf. Eor simplicity, and because of the high quality of acquired data, Anders et 
al. (2015) opted to use an IMP prior plus flat priors on Galactic structure, to derive distances. How¬ 
ever, their sample extends over a large range of Galactocentric distances, so that small systematic 
shifts like the one measured here are not excluded. 

In summary, we find a very good agreement between the two methods, modulo a small system¬ 
atic trend which we attribute to our choice of more comprehensive priors, and a scatter of ~ 15% 
which could be reduced by the inclusion of more photometric observations in our dataset. 


3.2.3. APOGEE - Cluster comparison 


In the third row of Pig. we compare our spectro-photometric distances with those obtained from 
isochrone fitting of star cluster color-magnitude diagrams (CMDs). We again restrict the com- 
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parison to the spectro-photometric distances based on the parameter set (1) in Sec. We use a 
subsample of the ~ 400 open and globular cluster stars that are used for calibration of ASPCAP, 
as described in Meszaros et al. ( |2013| l; Holtzman et al. (20151. We again refer to Table for a 
summary of the data used. Most of the stars with reliable parameters belong to the globular clusters 
M5, M13, M15, M92, M107, and to the open cluster M 67. The cluster distances and ages are 


adopted from the WEB DA cluster database in the case of open clusters and Harris (19961 for the 
globular clusters. Table [^summarises the cluster properties. 

The right panel in this comparison shows a hint that our distances are being underestimated 
for more distant clusters. The global mean residual is -16.5%, with an rms scatter of 30%. All 
clusters, except for NGC 2420 (t/ref - 3 kpc), have most of their stars with residuals smaller than 
30%. Instead of the field star priors discussed in Sec. |2.6[ we have used a simple age prior for each 
cluster. This adopted cluster prior simply assumes a lognormal distribution of cluster stellar ages, 
whose mean is the literature age estimate for the cluster and the dispersion is taken as d log r = 
0.1. We find that for most APOGEE clusters this prior is irrelevant, but the age prior improves the 
agreement with the reference distances for NGC 2420 and NGC 2158, while it negatively affects 
M71, which is the oldest and most metal poor cluster in the list. 

The comparison suggests that our spectro-photometric method, despite the scatter, yields dis¬ 
tances that are in general agreement with the cluster scale, but subject to significant systematics, 
especially as a function metallicity (see Eig. [^ third row, second panel). The fact that we do not 
see these systematic trends in the comparisons with the Hipparcos scale suggests that the cluster 
distances might be subject to improvements in the underlying stellar physics (e.g., the inclusion of 
Brandt & Huang|2015b I, and less of a gold standard than commonly assumed. Another 


rotation; 


important caveat is the possible contamination of the cluster sample with non-members. 

Regarding the uncertainty of isochrone-based cluster distances, it has been shown by |Pinson7| 
[neault et al.| ( |2000| l that open cluster distances (which are usually determined via measuring the 
main-sequence shift relative to the Hyades cluster) may be subject to zero-point shifts, based on 
changes in the adopted distance to the Hyades and interstellar reddening, as well as to the metal and 
helium abundance, and the age, of the Hyades (e.g. Brandt & Huang|2015a| l. However, assuming a 
conservative error of 0.1 mag for the cluster distance moduli, the uncertainties in the spectroscopic 
distances are still by far the most important. 


3.3. SEGUE Clusters 

The Sloan Extension for Galactic Understanding and Exploration (SEGUE) is a large optical spec¬ 
troscopic survey at low resolution (R ^ 2,000, Smee et al.|2013[ ). Its goal is to deepen our knowl¬ 
edge about the Galactic structural components and their stellar content, sampling them mostly at 
high latitudes ( |Yanny et al.|2009| |Eisenstein et al.|2011[ ). In this sense, SEGUE is largely comple¬ 
mentary to APOGEE in terms of coverage of the Galactic components. Eor more info on the spectra 
and instruments we refer to |Gunn et al.| ( |200^ and |Smee et al.| ( |2()13| l. SEGUE data have been pro- 
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Fig. 6. Comparison of our distances and the host cluster distances ^cluster for a subsample of clusters with 
SEGUE observations. For each cluster the median distance and the 68% quantiles (as error bars) are shown 
assuming the following different scenarios: open diamonds - uncalibrated SSPP DR9 parameters with field 
priors, black pentagons - a logg shift of 0.25 dex and field priors, and red circles - a shifted logg and a simple 
cluster age prior. For visibility the different symbols are slightly offset with respect to ^cluster- The lower panel 
shows the distance residuals, similarly to the right column of panels in Fig.[^ 


cessed through the SEGUE Stellar Parameter Pipeline (SSPP), which is described, along with its 
improvements, in a series of papers ( |Lee et ar]|2008a|b{ |Allende Prieto et 'ar]|2008b| |Smolinski| 
|et al.|20fT| |Lee et aL]|201 lb[ ). Particularly important to the SSPP is the validation of the derived 
stellar parameters. Eield stars with high-resolution spectra and known members of well studied star 
clusters have been used for that purpose ( |Allende Prieto et al.|2008b^ |Lee et al.|2008b[ [Smolinslo] 
|etal.|2011| ). 


As in the case of APOGEE (see Sec. |3.2.3| l, we here use the sample of cluster stars, whose 
distances are well-known from isochrone fitting, to further test our distance estimates. In the case 
of SEGUE data, we set mag = g, Mabs = Mg, and colors = {(m - g), (g - r), {r - i), (i - z)) (see 
Gunn et al. (19981 for a description of the SDSS camera and Eukugita et al. ( 1996|l for informa¬ 


tion on the photometric system). The spectroscopic and photometric data are from the SDSS Ninth 
Data Release (DR9; |Ahn et al.|2012| l database. Since [cr/Ee] is available for all SEGUE stars in 
our reference clusters, we assume that [M/H] = [Ee/H] -H [a/Ee] in order to compare the data to 
the model [M/H] values. As previously mentioned, all measured quantities have associated uncer¬ 
tainties. The measured photometric quantities also must be corrected for extinction and reddening 
to allow a direct comparison to the models. Again, the data for the clusters used here are listed in 
Table H 


The original sample contained 11 clusters, among open and globular, totalling a bit over 1000 
stars. Eor 593 stars, a complete set of spectroscopic and photometric parameters and associated 
uncertainties allowed us the use of parameter set (1) to estimate distances. As the sample is spec¬ 
troscopic, in all other cases the missing parameter(s) was(were) photometric. Distances were suc¬ 
cessfully computed (i.e., at least one model was found within 3 cr of all the measured quantities) 
for 425 out of the 593 stars, pertaining to the eight clusters listed in the table. We include only 
those clusters with at least five confirmed members and successfully derived distances using our 
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Fig. 7. Left panel: The contours show the density of SEGUE dwarfs in the logg vi. [Ee/H] plane. The colour 
density map corresponds to the same type of stars simulated by the TRILEGAL code. Right panel: same as in 
the previous panel, but stars are shown in the log g vi. Teft plane. 


approach. Their distances, ages and metallicities were again taken from the WEBDA database in 
the case of open clusters, and from|Han^(|1996|) in the case of the globular clusters. 


In the last row of Fig. we show the distance residuals of individual stars against [Fe/H], 
Teff, logg, and reference cluster difference, similarly to what we did with the other samples in this 
Section. The scatter is larger than in previous samples, attesting the lower resolution of the data. 
Still, most of the stars have residuals close to or within the 20% lines and do not exibit any strong 
systematics with the spectroscopic parameters. The more deviant points correspond to a fraction 
of the low metallicity ([Fe/H] < -1.0) giant and subgiant (logg < 4.0) stars, located beyond 
6 kpc. The mean and rms residuals over all stars are 0.14 + 0.43 after applying a 3-cr clipping. 
Our individual distances are biased towards large values for a signihcant fraction of the stars in the 
more distant globular clusters. This trend is opposite to what was observed in the case of APOGEE 
clusters (see Sec. 3.2.3| l. If we consider only the 3 clusters with t/jef < 5 kpc, the mean and rms 
scatter are reduced to 0.03 + 0.27. 


We investigated further the cause of this systematic overestimate of the distances to part of 
the SEGUE stars in the more distant clusters. In analyzing the distribution of different SEGUE 
spectroscopic parameters, it became clear that the logg parameter for our sample, as published 
in DR9, appears to be underestimated for most stars in those clusters. Recalling that logg is the 
most important quantity to efficiently separate dwarf and giant stellar models, we have decided to 
heuristically correct for this shift in this work. 


The contours (black) in Fig.|7]show the distribution of a large sample of SEGUE G-dwarf stars 
(sample compiled from DR9 and described and analysed in Brauer 2015 (PhD thesis, subm.)) in the 
logg vs. Teff (left panel) and logg vs. [Fe/H] (right panel) plane. For comparison, we also present 
the density distributions for a mock sample of G-dwarf stars obtained from TRILEGAL simulations 
( Girardi et al.|2005' 2012| l covering the same SEGUE plates as present in the SEGUE G-dwarf data 
sample. These simulations were carried out using ADDSTAR, a web based tool that uses parallel 
processing to efficiently run TRILEGAL in many independent pointings ( jBalbinot et al.|2012) l. The 
TRILEGAL G-dwarf mock sample shown here has been compiled by applying the same overall 
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cuts to the simulation that were used for the SEGUE G-dwarf data sample including the colour- 
magnitude limits for SEGUE G-dwarfs. Eor a detailed description of the TRILEGAL mock sample 
see Brauer (2015). 

A clear shift is seen in both panels, in the sense that the data have systematically lower values 
of log g for a fixed T^ff or metallicity by ^ 0.25 dex. The comparion with the population synthesis 
model reveals that the SSPP log g values published in DR9 are underestimated, and that shifting 
the log g values for the SEGUE stars is clearly needed, at least for the purpose of deriving reliable 
distances. Eurther details about the TRILEGAL mock G-dwarf sample, the discrepancies seen 
in stellar parameters between data and simulation, and the underestimated surface gravity values 
published with DR9 are given in Brauer (2015). 

We thus decided to test the effect of applying a shift of 0.25 dex to the logg values when 
computing the distances. The results are shown in Eig. where we compare the median spectro- 
photometric distances of each cluster with the reference cluster distances for different log g cali¬ 
brations and/or priors used. The open diamonds correspond to median cluster distances using the 
SEGUE data as they come from DR9. The black pentagons show the effect of shifting logg as 
explained above. The global mean offset relative to the reference distances has been reduced to 
-2.4%, although the rms relative residual remains high (35%). Eor the three more nearby clus¬ 
ters, the distances become systematically underestimated by -18% although the scatter around the 
mean residual has decreased considerably to 12%. The result of adopting a simple age prior for the 
clusters, as done for APOGEE data, is shown as the red circles of Eig. This choice has a sizable 
effect on the distances in the sense of increasing them for the more distant clusters, yielding a mean 
and rms relative residual of -1-0.13 ± 0.39. 

We conclude that, despite the large scatter in distances of individual cluster stars, there is a 
general agreement between our median distances for SEGUE cluster stars and those from CMD 
htting. A fraction of the stars at lower metallicities and logg, and located in the more distant 
globular clusters, have their distances overestimated by our method. This seems to be because the 
SEGUE logg scale is systematically too low, as attested by simulated samples of SEGUE stars. 

Einally, we emphasize that the SEGUE data is very distinct from APOGEE in terms of spectral 
range and resolution. These data also probes much larger distances than the Hipparcos based data 
used in Sec. 13.II 


3.4. RAVE 


RAVE collected medium-resolution (R ~ 7,500) Call triplet spectra of > 400,000 stars with 
9 < / < 13. Eor > 200,000 of these stars, Burnett et al.|(|20lT]l have been able to derive spectro- 


photometric distances using spectroscopic parameters from the RAVE pipeline (Kordopatis et al. 
|2011[ ) as well as near infra-red photometry from 2MASS ( [Skrutskie et al.|2006[|Cutri et al.|200^ . 
Their method has provided the background for the work presented in this paper, which is largely 
based on the Bayesian approach proposed by |Burnett et al.| ( |2011| l. More recently, jBinney et al.| 

Article number, page 21 of 30 

















Santiago, Brauer, Anders, Chiappini, Queiroz et al.: Spectro-photometric distances to stars 


Comparison to the RAVE DR4 distances 
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Fig. 8. Comparison of our distance estimates against the RAVE DR4 distances from Binney et al. 1 2014 1 for 
the high-SNR sample of |Boeche et al.|{2013a^ . In each panel the vertical axis shows the absolute distance 
residuals normalised to the combined uncertainty from both methods, = -\/<^(^bpg)^ + cr(tlRAVEp. as 

a function of the main spectroscopic parameters [Z/H] and logg as well as the RAVE distance, c/rave- 


(20141 presented an improved version of the method. In this section, we use their distances for a 


high S/N sample containing > 9000 giant stars ( [Boeche et al.||2013a) l to cross-check the imple¬ 
mentation of our method (details in Appendix]^. The spectroscopic parameters are taken from the 
RAVE Data Release 4 (DR4; [R)rdopatis et al.|2013] l, the photometry is from 2MASS. 

In order to derive distances for RAVE stars, we set m = Ks, M^bs - Mk,, and colors = 
{(7 - H),(H - K)}. We adopt the Ay extinction values that have been computed for individual 
RAVE objects by |Binney et al. (2014 1 to deredden the NIR magnitudes of the stars. The conversion 
between the extinction in the optical and near-IR wavelength range is performed following |Rieke & 
Lebofsky'] ( |1985 i. We adopted the [Ee/H] and [a/Ee] estimates from the chemical RAVE pipeline 
( [Boeche et al.||20lT| [Kordopatis et al.|[2013| l to make our metallicity scale more compatible with 
the stellar models, as in the previous sections. 

Despite the use of a very similar theoretical framework, we note that there are some differences 
between the way distances are estimated by [Binney et al.j ( [2014| l and in this paper. The Padova 
isochrone grid these authors used ( [Bertelli et al.[200^ is not the same as the one used for this paper, 
and it was restricted to [Ee/H] > -1. Also, they use a slightly different IME ([Aumer & Binney 


20091, and apparently they did not use the volume element s'^ds in the density priors presented in 


Sec. 2.6 They also apply a kinematic correction for their distances ( Schonrich et al.|2012[ ). Erom 
the data side, we have used a slightly different metallicity scale, namely that of the RAVE chemical 
pipeline instead of the RAVE stellar parameter pipeline, together with a different handling of a- 
enhancement. 

Eig. 1^ shows the comparison of our distance estimates with the distances from [Binney et al.| 
( [2014[ ) for the giant sample analysed in [Boeche et al.[ ( |2()13a[ l. Eor compatibility with our approach 
described in Sec. we use the (s) distance estimate from the calibrated DR4 catalogue from 
Binney et al. ( 2014| l in this comparison, which is the expectation value computed from their model 
pdf over distances instead of their corresponding parallaxes. Because neither of the distances being 
compared are supposed to be at the same level of precision as those in our reference samples, we 
change the way we compare them. Instead of computing the relative residual with respect to any 
one of the estimates, we normalise the residual by the combined uncertainty from both, which 
are added in quadrature. Our distance scale seems to be slightly compressed relative to RAVE 
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RAVE Stars: Effect of adopted priors 



Fig. 9. The effect of the adopted priors (Sec. |2.6| on our spectro-photometric distances (BPG), illustrated 
for the case of the RAVE sample. In each panel, we show the relative distance residuals (priors - no priors) 
with respect to the case where no priors were used, as a function of the main stellar parameters TeB, [Z/H], 
logg, and as a function of position in the Galaxy (right panels). The top row shows the results for distances 
computed using the IMF prior and a flat prior in age instead of the logarithmic one implicit in the spacing of 
isochrones in log t. The middle row also includes the spatial density priors. Finally, the bottom row includes 
the MDF and age priors for the different Galactic components. 


DR4: we have systematically larger (smaller) distances than RAVE for stars with £/ravedr4 < 1 
('^RAVEDR 4 > 1) kpc. The effect, however, is small. The vast majority of the stars have distance 
residuals well acommodated by the expected uncertainties in the two estimates being compared. 
The mean and rms normalised residuals are 0.00 + 0.33, after a 3-cr clipping to eliminate the very 
few strongly deviant points between the two methods. Looking separately at the two distances 
regimes, we have mean relative residuals of 6% and -16%, respectively for £/ravedr4 < 1 and 
£^ravedr4 > 1 kpc. 


3.5. Effect of different priors 


In Fig. we assess the effect of the adopted model priors on our distance estimates. For this 
purpose, we use the high quality RAVE sample (:^ 9000 stars) presented in Sec. |3.4| 

The aim of this exercise is merely to be able to quantify the effect of the adopted priors on a 
particular stellar population, not the justification of these priors (which is discussed in Sec. |2.6| l. 
Note that for the internal consistency checks discussed in Secs. |2.3| |2.4| and |2.5| we did not use 
any priors, which is the appropriate decision for our simulation, but not for real Milky Way stellar 
populations. 

The top row panels in Fig. compare distances based on no assumed priors, as used in the 
simulations, to those using only the IMF prior, as presented in Sec. |2.6[ and a correction for a 
uniform age distribution. The distances in the middle panels include the spatial priors, but still 
exclude the specific age distributions and MDFs of individual Galactic components presented in 
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Sec. |2.6| The comparison with the distances that also take these later into account is left to the 
bottom panels in the figure. 

It is clear from the figure that inclusion of age and metallicity priors both tend to systematically 
reduce the distances of a significant fraction of the stars. Restricting the thick disc and halo samples 
to old ages, in particular, prevents models of young luminous stars to be included in their distance 
estimates. Similarly, associating disc stars to relatively metal-rich models also tends to reduce their 
distances. A large number of stars are unaffected by the specific choice of model priors. Using all 
stars with estimated distances i < 6 kpc (when all priors are included, comprising the vast majority 
of the RAVE high quality sample) we obtain a mean residual of 5% and a scatter of 6% between 
the all priors and spatial priors only distances. This is, in fact, smaller than the systematic residuals 
and scatter found with respect to more precisely determined distances, as discussed in Sec.[^ 


4. Summary and conclusions 


We have implemented a code to estimate distances to field stars based on their measured spectro¬ 


scopic and photometric parameters. The theoretical background is very similar to that of Burnett 


|& Binne^ ( |2010| l and subsequent papers from the same authors. Briefly, given a set of measured 
quantities, these are compared to model values and, using a Bayesian approach, the posterior prob¬ 
ability distribution function is derived for each star. The distance is then determined as the median 
value of the posterior pdf marginalised over the other parameters. 

We used simulated stars from PARSEC models as a first validation test of our distances. We 
showed that distances based on 4 different sets of parameters are well recovered, without significant 
systematic biases, and statistical uncertainties that scale with the uncertainties of the input param¬ 
eters. Distances to dwarf stars are more accurately recovered, with <1% mean residual and 10% 
rms errors for a “high-resolution” simulated sample; typical errors for giant stars are about twice 
those values. The method is most sensitive to logg, and starts breaking down if this parameter is 
removed from the data set, also yielding strong systematic distance errors as a function of bias in 
this parameter. 

Our simulations also show that the estimated distance uncertainties are in general agreement 
with the errors, with the uncertainty distribution peaking at ^ 10% 20%) for dwarf (giant) 

stars in a mock high-resolution sample, reaching significantly larger values when spectroscopic 
parameters are less precise. 

We demonstrated that distances of nearby stars with parallaxes from Hipparcos and precise 
atmospheric parameters are successfully recovered, with only very small systematic trends, and 
random errors consistent with the expected combination of errors in the parallaxes and in our 
distances. The relative random errors are of the order of 20% or less, with <1% systematic errors, 
for nearby FGK dwarf stars. For giants with Hipparcos parallaxes, the random and systematic 
residuals in our distance estimates are also as expected based on our mock samples. 
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For more distant stars, we have validated the code using giant stars with asterosismic oberva- 
tions, as well as certain members of well-studied star clusters. In the later case, we compared our 
distances to the cluster distances taken from the literature. 

Our distances for giant stars in common between APOGEE and CoRoT typically agree within 
15% of each other. A systematic overestimate in our distances in comparison to those inferred from 
asteroseismology at the level of ^ 9% may result from using a more set of comprehensive Galactic 
priors and a less comprehensive set of photometric measurements than our reference. 

We used star clusters included in the SDSS-III SEGUE and APOGEE samples for a final set 
of validation tests, this time covering a larger range in distances, out to ^ 12 kpc. The distance 
scatter among individual cluster stars is larger than the distance residuals for the more nearby stars 
with parallax measurements. Eor APOGEE clusters, there is a small trend with distance itself, in 
the sense of underestimating the distances to the more distant cluster stars (d > 6 kpc). 

In the case of SEGUE, scatter tends to be higher than APOGEE, consistent with the lower 
resolution of the spectra. In particular, for a fraction of the low-metallicity giant and subgiant stars 
that predominate in the more distant globular clusters used, our distances tend to be overestimated, 
contrary to the trend seen in APOGEE. We have identified the probable origin for this discrepancy 
as a systematic bias in the SEGUE logg values. Applying a shift of 0.25 dex to the SEGUE logg 
reduces the mean distance residual over individual stars from 14% to -2% when all clusters are 
considered. The scatter is also significantly reduced for the more nearby cluster SEGUE stars. 
Despite the large scatter observed for a subset of the SEGUE stars, the inferred average distances 
to the SEGUE clusters are in good agreement with those quoted in the literature. 


We have also compared our distances to those for a sample of high-quality RAVE stars, deter¬ 
mined with similar approaches to the one presented here ( |Binney et al.||2014l l. Despite the large 
scatter and a slight systematic trend, our distances for the same sample of red giant stars studied in 
Boeche et al. (|2013a[) agree, within the expected uncertainties, with those derived by|Binney et al. 


( |20T4l ). 


The validation results are all summarised in Table[^ where we show the dataset used, the num¬ 
ber of stars and the range of reference distances in each, and the mean and rms relative residuals. 
The table allows a global evaluation of the distances, specially when confronted to those from as¬ 
trometry, asteroseismology, and well calibrated star clusters. The systematic residuals are usually 
less than 9% while the scatter scales with the quality of the input spectro-photometric parameters, 
with the most distant SEGUE clusters being our worst case. 


The code presented and validated in this paper is being used by the Brazilian Participation 
Group on SDSS-III to derive distances for tens of thousands of stars belonging to the SEGUE and 
APOGEE surveys. These surveys are complementary in many ways, including the directions and 
Galactic components they probe more efficiently. When analysed together, provided that reliable 
distances are available, they allow a 3D mapping of Galactic structure using different stellar tracers, 
determination of chemo-kinematic relations in the Galaxy, and ultimately, confrontation of these 
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constraints with models of Galactic formation and evolution ( [Minchev et al.|20T3l|2014al l. Initial 
science analyses based on these distances are provided by |Junquerra et al.| ( |2015| l and [Anders et al.| 
( |20T4| ). 
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Appendix A: APOGEE distances and caiibration of steiiar parameters 


For the computation of distances to APOGEE stars, we make use of the ASPCAP stellar parameters 
bundled in the twelth data release of the Sloan Digital Sky Survey (SDSS DR12; |Alam et al.|2015| l. 
The APOGEE data products contained in this release are described in detail in [Holtzman et al.| 
( |2015| ). While we generally use the stellar parameter calibrations provided by these authors, we 
have adopted slightly different calibrations for metallicity and surface gravity, which are explained 
below. 

Our APOGEE distances will be released within an SDSS DR12 value-added APOGEE distance 
catalogue. 

Surface gravity for red-clump [Holtzman et al.| ( |20l5] l use the very accurate and precise 
surface gravities from the AVOGEE-Kepler sample ( jPinsonneault et al.|2014| ) to calibrate out sys¬ 
tematic shifts in the surface gravities delivered by ASPCAP. They demonstrate (their Pig. 4) that 
these systematics are different for red-giant branch (RGB) and core-He burning red-clump stars 
(RC). The offset between both groups is about 0.2 dex. However, the authors only provide a sur¬ 
face gravity calibration for RGB (their formula 3), and for RC stars defer the reader to the APOGEE 


RC catalogue described by Bovy et al. (20141 who provide very precise distances to these stars. 


Because we wanted to test our distance code on all APOGEE giant stars, we have separately ht the 
calibration relation for the stars contained in the RC catalogue: 


loggRc = loggcaiib - (7.21 ■ 10-^ ■ (7’efF[K] - 4400) + 0.129) 
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Metallicities in the super-solar regime: ASPCAP metallicities have been calibrated using open 
and globular cluster metallicities from the literature. Fig. 6 of |Holtzman et al.] ( |2015| ) shows the 
result of the ’’external" metallicity calibration: in the metal-poor regime, the raw ASPCAP metal¬ 
licities are up to 0.3 dex too high compared to the literature, while the values are similar at solar 
metallicities. The authors have therefore opted to fit a second-order polynomial to the data (their 
formula 6). But because there are very few calibration clusters above solar metallicity, the quadratic 
calibration relation may be overfitting the data at the metal-rich end, and therefore overestimate 
metallicities in this regime. We have thus opted to use the calibrated values only when [Z/H]< 0.0, 
and to otherwise use the uncalibrated values. 


Appendix B: SEGUE distances 


For the computation of SEGUE distances, we use the results of the SEGUE Stellar Parameter 
Pipeline (SSPP; |Lee et al.|2008a|b|[2011a[[Stnolinski et al.|20lT] l for a clean sample of G dwarfs 
(details will be provided in Brauer 2015). Prom SDSS DR8 to DR9, it underwent a couple of 
modifications providing improved stellar parameters for all SEGUE objects within the scope of 
DR9. In particular, there are some obvious systematic shifts in the individual parameters when 
comparing DR8 and DR9 SSPP values for the same objects. 

The most significant deviation is observed for the surface gravity showing a clear overall de¬ 
crease of the parameter by 0.2 dex to 0.3 dex to lower values. This finding is in agreement with the 
fact that the DR9 surface gravity is in general lower by about ~ 0.2 dex (Rockosi et al. 2015, in 
prep.). Possible explanations for the decrease are the following: (1) the surface gravity estimates 
from the re-analysis of the high-resolution spectra that are used to calibrate the DR9 SSPP results 
are in general lower by 0.13 dex than those from the older high-resolution analysis, (2) for some 
of the logg estimation methods utilised by the SSPP, new synthetic spectral grids were used, and 
(3) the line index methods MgH and Cal 2 were not considered any longer - which may have had 
a substantial effect since those estimators produced higher logg values compared to others. The 
overall lower gravity is certainly caused by a combination of the above facts. 

Besides the shift in surface gravity, the temperatures in DR9 tend to be systematically larger 
than in DR8 by ~ 50 K. This temperature shift occurs due to the IRPM temperature scale being 
utilised to calibrate the SSPP DR9 temperature estimators. 

While the metallicity scale did not change significantly, the [a/Ee] abundances shifted to higher 
values by about 0.1 dex. 

logg calibration: As shown in Sec.|^ accurate absolute surface gravity values are crucial for 
computing spectro-photometric distances. Any systematic under- or overestimation of the surface 
gravity will lead to significant under- or overestimation of our distances. Hence, as justified in Sec. 
|3.3[ we decided to calibrate the SEGUE DR9 surface gravity values by applying a shift of 0.25 dex 
and thus increasing the logg of each object in the G-dwarf sample by the same amount. Apart from 
this modification, we use the SEGUE DR9 values delivered by the SSPP. 
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Appendix C: RAVE distances 


In this work, we have used the RAVE giant sample assembled by |Boeche et al.| ( |2013a[ ). It comprises 
stars with the highest quality spectra and abundances. The following quality criteria have been 
applied; (1) only spectra with a signal-to-noise ratio of at least 60 were selected on which the 
RAVE pipeline converged to a single point of the parameter space (Elag Algo_Conv = 0), (2) the 
chemical pipeline converged < 1000), (3) the number of defective pixels along the spectrum 
is small (£rac > 0.99), and (4) every object that is not classified as a normal star according to the 
morphological classification described in Matijevic et al. (2012|l was excluded. Like Binney et al. 


(20141, we use the stellar parameters from DR4 ( |Kordopatis et al. 2013| l. The applied cuts to select 
RAVE giants are 0.5 dex < logg < 3.5 dex and 4000 K < Teff < 5500 K. This ensures that only 
cool giants are selected and problems with the grid limits of the automated pipeline are avoided by 
setting the lower limit in logg. The resulting RAVE giant sample comprises 9098 stars. 

We will also provide distances for the full RAVE DR4 sample in the near future. 
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